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ABSTRACT 

Instead of a classical single-stranded deoxyri- 
bonuleic acid (DNA)-binding protein (SSB), some hy- 
perthermophilic crenarchaea harbor a non-canonical 
SSB termed ThermoDBP. Two related but poorly 
characterized groups of proteins, which share the 
ThermoDBP N-terminal DNA-binding domain, have a 
broader phylogenetic distribution and co-exist with 
ThermoDBPs and/or other SSBs. We have investi- 
gated the nucleic acid binding properties and crys- 
tal structures of representatives of these groups 
of ThermoDBP-related proteins (ThermoDBP-RPs) 1 
and 2. ThermoDBP-RP 1 and 2 oligomerize by differ- 
ent mechanisms and only ThermoDBP-RP2 exhibits 
strong single-stranded DNA affinity in vitro. A crystal 
structure of ThermoDBP-RP2 in complex with DNA 
reveals how the NTD common to ThermoDBPs and 
ThermoDBP-RPs can contact the nucleic acid in a 
manner that allows a symmetric homotetrameric pro- 
tein complex to bind single-stranded DNA molecules 
asymmetrically. While single-stranded DNA wraps 
around the surface or binds along channels of pre- 
viously investigated SSBs, it traverses an internal, 
intersubunit tunnel system of a ThermoDBP-RP2 
tetramer. Our results indicate that some archaea have 
acquired special SSBs for genome maintenance in 
particularly challenging environments. 



INTRODUCTION 

Deoxyribonucleic acid (DNA) replication, repair, recom- 
bination and telomere maintenance require the transient 
unwinding of duplex DNA (1,2). To maintain and protect 
DNA in the single-stranded (ss) state, all organisms harbor 
essential single-stranded DNA-binding proteins (SSBs) that 
bind ssDNA with high affinity and low sequence specificity. 
Classical SSBs harbor one of four distinct DNA-binding 
domains: the oligonucleotide/oligosaccharide-binding 
(OB) fold; the K homology (KH) domain; the ribonu- 
cleic acid (RNA) recognition motif (RRM) or the whirly 
domain (3). The sequences of the proteins are poorly 
conserved in either group (4) and different SSBs adopt a 
variety of oligomeric states (5), thereby bringing several 
ssDNA-binding domains together (6). 

Recently, a group of hyperthermophilic crenarchaea has 
been found to lack a classical SSB and instead to contain 
a distinct SSB, termed ThermoDBP (7). The crystal struc- 
ture of the ssDNA-binding N-terminal domain (NTD) of 
Thermoproteus tenax (tte) ThermoDBP, comprising a four- 
stranded (3 -sheet packed against four a-helices (7), differs 
markedly from the classical SSB DNA-binding domains, 
and r^^ThermoDBP was predicted to dimerize in a paral- 
lel fashion via a C-terminal leucine zipper (7). However, the 
exact mode of oligomerization and the mechanism of DNA 
binding of ThermoDBPs so far remained elusive. 

ThermoDBPs share sequence similarity with domain 
of unknown function (DUF) 2258 proteins, which have 
a broader phylogenetic distribution in archaea (Thermo- 
proteales, Sulfolobales, Desulfurococcales, Thermococci 3.nd 
Archaeoglobi). DUF2258 proteins can be divided into 
two groups that we term ThermoDBP-related proteins 
(ThermoDBP-RPs) 1 and 2. Both groups contain a 
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ThermoDBP-like NTD but differ from ThermoDBPs and 
from each other in their C-terminal regions (Supple- 
mentary Figure SI). The ThermoDBP-hke NTD sug- 
gests that ThermoDBP-RPs might also be SSBs. However, 
^f^ThermoDBP-RPl was not recovered from cell ly sates 
via a biotinylated oligodeoxynucleotide used for isolation 
of fr^ThermoDBP (7). Furthermore, Sulfolobus solfataricus 
{sso) ThermoDBP-RPl was reported to associate with box 
C/D small (s) RNAs and with the SOS ribosomal subunit 
(8), but the protein was not detected in a subsequent pro- 
teomic characterization of sso ribosomes (9). We therefore 
set out to further characterize the structures and functions 
of ThermoDBP-RPs. 

Our analyses show that ThermoDBP-RP2s are uncon- 
ventional SSBs that bind ssDNA in an asymmetric fashion 
at an internal, intersubunit tunnel system of a symmetric 
protein tetramer. The apo-structure of ThermoDBP-RPl 
suggests that DNA binding by this group of proteins may be 
auto-inhibited, explaining their weak nucleic acid affinities 
observed in vitro. Structural comparisons reveal that the eu- 
karyotic DNA polymerase eta has acquired a domain that 
closely resembles the N-terminal ssDNA-binding domain 
of ThermoDBPs and ThermoDBP-RPs but employs a dif- 
ferent molecular surface for binding to double-stranded (ds) 
DNA. 

MATERIALS AND METHODS 

Protein production 

Coding sequences of Pyrococcus furiosus ipfu) 
ThermoDBP-RPl and Aeropyrum pernix (ape) 
ThermoDBP-RP2 were polymerase chain reaction (PCR)- 
amplified from P. furious and A. pernix genomic DNA. 
The PGR fragments encoding full-length ThermoDBP-RP 
genes were cloned into pET-Mll (EMBL, Heidelberg) 
via Ncol/BamHI restriction sites in frame with an N- 
terminal His6-tag followed by a Tobacco Etch Virus (TEV) 
protease cleavage site. For expression, cells were grown 
in auto-inducing medium (10) at 37°C to an OD600 of 
0.8, transferred to 18°C and harvested after 60 h. Cell 
pellets were resuspended in 20 mM HEPES-NaOH, pH 
7.5, 500 mM NaCl and 5 mM (B-mercaptoethanol sup- 
plemented with 10 |xg/ml DNasel, 3 mM MgCl2 and 
100 |xg/ml lysozyme. Cells were lyzed by four passes 
through a microfluidizer at 80 kPsi. The cell ly sates were 
clarified by centrifugation. For purification, clarified 
ly sates were loaded onto Ni^^-NTA resin equilibrated 
with lysis buffer. Beads were washed with one bed vol- 
ume of 20 mM HEPES-NaOH, pH 7.5, 2 M LiCl and 
5 mM p-mercaptoethanol followed by washing with 5 
bed volumes of 20 mM HEPES-NaOH, pH 7.5, 500 mM 
NaCl, 20 mM imidazole and 5 mM (B-mercaptoethanol. 
Proteins were eluted with 20 mM HEPES-NaOH, pH 
7.5, 200 mM NaCl, 5 mM p-mercaptoethanol and 300 
mM imidazole. Eluates were mixed with TEV protease 
(mass ratio protease/protein 1:50) and dialyzed overnight 
at 4°C against 20 mM HEPES-NaOH, pH 7.5, 100 mM 
NaCl and 2 mM (B-mercaptoethanol. The dialyzed samples 
were again loaded on Ni^"^-NTA resin equilibrated with 
dialysis buffer to remove the TEV protease and uncleaved 
protein. The flow-throughs were further purified by ion 



exchange chromatography. Samples were loaded on a 
MonoQ column (GE Healthcare) equilibrated with dialysis 
buffer. Proteins were eluted with a linear gradient from 
100 mM to 500 mM NaCl over 50 column volumes. 
Fractions containing the proteins of interest were loaded 
on Superdex 75 (/?/wThermoDBP-RPl) or Superdex 200 
(a/7^ThermoDBP-RP2) size-exclusion chromatography 
columns (GE Healthcare) pre-equilibrated with 10 mM 
Tris-HCl, pH 7.6, 50 mM NaCl and 1 mM DTT. Fractions 
containing pure protein were pooled and concentrated 
to 20 mg/ml using Amicon concentrators (Millipore). To 
prepare selenomethionine (SeMet)-labeled proteins, E. 
coli BL21(DE3) cells were grown in SeMet auto-inducing 
medium (10). The other steps for expression and protein 
purification were the same as described above. 

Electrophoretic gel mobility shift assays 

Box C/D small RNAs (sRNAs) were synthesized by in vitro 
transcription using T7 RNA polymerase and PCR ampli- 
fied templates from P. furious genomic DNA. Transcription 
products were purified on a 6% urea-TBE denaturing poly- 
acrylamide gel followed by phenol-chloroform extraction 
and ethanol precipitation. All other oligonucleotides (Sup- 
plementary Table SI) were chemically synthesized (Dhar- 
macon and MWG/Operon). To prepare DNA or RNA du- 
plexes, complementary strands were mixed in equimolar ra- 
tios, heated to 80°C and slowly cooled (0.2°C/min) in a 
thermocycler to 20° C. The samples were then separated on 
agarose gels and duplex bands were extracted and ethanol 
precipitated. For electrophoretic gel mobility shift assays, 
oligonucleotides were 5^-end labeled with 7-[^^P]-ATP. La- 
beled oligonucleotides were mixed with recombinant pro- 
teins in 20 mM Tris-HCl, pH 8.0, 150 mM NaCl, 2 mM 
DTT and incubated at 20°C or 80°C for 10 min. To com- 
pete RNA binding, 0.5 g/1 E. coli transferRNAs (tRNAs) 
were added to the reactions. Samples were then fraction- 
ated on a 6% (60: 1) polyacrylamide gel and visualized using 
a Phosphorlmager (Typhoon 8600, GE Healthcare). Bind- 
ing of ap^ThermoDBP-RP2 to single-stranded and circular 
Ml 3 phage DNA was tested on a 1% agarose gel. Increas- 
ing concentrations of the protein were mixed with 100 ng 
DNA and incubated at 4°C for 1 h. DNA was visualized by 
ethidium bromide staining. 

Ribosome binding assays 

Ribosomes and ribosomal subunits from Thermococcus ko- 
dakarensis were prepared as described (9). In a total re- 
action volume of 150 |xl, 50 pmoles of 30S, 50S or 70S 
T. kodakarensis ribosomal subunits or ribosomes were in- 
cubated with 400 pmoles of His6 -tagged /7/wThermoDBP- 
RPl or a/?^ThermoDBP-RP2 in a buffer containing 10 
mM Tris-HCl, pH 7.5, 10 mM MgCls, 1 mM DTT, 45 
mM NH4CI and the varying concentrations of NaCl. The 
mixture was incubated for 20 min at 37°C and was then 
placed on 90 fxl of a 20% sucrose cushion. The cushion 
was centrifuged for 30 min at 78 000 rpm in a TLA 100 
rotor. After centrifugation the supernatant was removed 
and the pellet was resuspended. The input sample before 
centrifugation, the supernatant and the pellet were ana- 
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lyzed by sodium dodecyl sulphate-polyacrylamide gel elec- 
trophoresis followed by Western blotting. The membranes 
were stained by amido black to check the protein content in 
each lane and were then treated by anti-His6 antibody for 
detection of ThermoDBP-RPs. 

Isothermal titration calorimetry 

Isothermal titration calorimetry (ITC) experiments were 
performed at 25° C on an iTC200 microcalorimeter 
equipped with a 300- |xl syringe (Microcal, GE Healthcare). 
DNA oligonucleotides and (2/7^ThermoDBP-RP2 were dial- 
ysed against the same buffer overnight. op^ThermoDBP- 
RP2 was titrated at 10 |jlM concentration in a cell volume of 
250 |jl1 with 50-100 fxM of different DNA oHgonucleotides 
in 16 injections of 2.5 |jl1 volumes with 5-min intervals. The 
released heat was obtained by integrating the calorimetric 
output curves. Binding parameters were calculated using 
the Origin5 software using the 'One Set of Sites' curve 
fitting model provided by the software. 

Analytical size-exclusion chromatography 

Analytical size-exclusion chromatography was conducted 
on Superdex 75 (/?/wThermoDBP-RPl) or Superdex 200 
((2/7eThermoDBP-RP2) PC 3.2/30 analytical size-exclusion 
columns (GE Healthcare) in 20 mM HEPES-NaOH, pH 
7.5, 100 mM NaCl. Exclusion volumes (Vq) were estimated 
using Blue Dextran (approximate molecular mass 2 MDa). 
Bovine 7 -globulin (158 kDa), rabbit lactate dehydrogenase 
(140 kDa), chicken conalbumin (75 kDa), bovine serum al- 
bumin (67 kDa), chicken ovalbumin (45 kDa) and horse 
myoglobin (17 kDa) were used as molecular mass stan- 
dards. Linear regression of plots of log (molecular mass) 
versus the ratios of the observed elution and exclusion vol- 
umes (Ve/Vo) allowed calculation of the apparent molec- 
ular masses of /?/i/ThermoDBP-RPl and op^ThermoDBP- 
RP2 based on the observed elution volumes of the proteins. 

Crystallization of ThermoDBP-RPs and of an 
a/?^ThermoDBP-RP2-dTio complex 

Crystallization of />/wThermoDBP-RPl was carried out 
at 20°C using the sitting drop vapor diffusion method. 
/?/t/ThermoDBP-RPl crystallized in two different crystal 
forms. Form 1 crystals of SeMet-labeled /7/wThermoDBP- 
RPl were obtained by mixing 1 \x\ of SeMet-labeled pro- 
tein solution with 1 \x\ of reservoir solution containing 50 
mM sodium cacodylate, pH 6.4, 2.3 M (NH4)2S04 and 10 
mM MgS04. Prior to flash-cooling in liquid nitrogen, form 
1 crystals of />/wThermoDBP-RPl were soaked in reservoir 
buffer supplemented with 25% (v/v) glycerol. Form 2 crys- 
tals of SeMet-labeled ;7/wThermoDBP-RPl were obtained 
by mixing 0.5 |jl1 of protein solution with 1 |jl1 of reservoir 
solution containing O.l M Tris-HCl, pH 8.5, 0.2 M Li2S04 
and 40% (v/v) PEG400. These crystals were flash-cooled in 
a 100 K cryogenic stream in their mother liquor. Crystals of 
a/?eThermoDBP-RP2 were produced by mixing equal vol- 
umes of SeMet-labeled ap^ThermoDBP-RP2 with a reser- 
voir solution containing 0.1 M imidazole, pH 8.0, 0.4 M 
NaH2P04, 1 .6 M K2HPO4, 0.2 M NaCl and 0.25 M glycine. 



Crystals were cryo-protected by addition of 25% (v/v) glyc- 
erol to the mother liquor. 

For co-crystallization with ssDNA, ap^ThermoDBP- 
RP2 (17 mg/ml) was mixed with a 10-fold molar excess of 
dTio in 10 mM Tris-HCl, pH 7.6, 50 mM NaCl and 1 mM 
DTT. The mixture was incubated for 5 min at 80° C. Crys- 
tallization experiments were performed in a sitting drop for- 
mat at 18°C. Crystals appeared after three days over a reser- 
voir solution composed of 200 mM NaOAc, pH 4.3, 1.3 M 
NaH2P04 and 0.7 M K2HPO4. For cryo-protection, crys- 
tals were transferred to mother liquor supplemented with 
15% (v/v) glycerol and subsequently flash-cooled in liquid 
nitrogen. 

Crystallographic procedures 

Diffraction data were collected at beamline BL14.2 of the 
BESSY II storage ring (Berlin, Germany; Supplementary 
Table S2). Data were processed with the XDS package (1 1). 
The structure of />/wThermoDBP-RPl form 1 crystals was 
solved by the multiple-wavelength anomalous dispersion 
method using the program SHARP/autoSHARP (12). The 
structure of the second crystal form of /?/wThermoDBP- 
RPl was solved by molecular replacement using the 
structure coordinates of the crystal form 1 as a search 
model with the program MOLREP (13). The struc- 
ture of a/7^ThermoDBP-RP2 was solved via the single- 
wavelength anomalous dispersion method using the pro- 
gram HKL2MAP (14,15). Finally, the crystal structure 
of an a/7^ThermoDBP-RP2-dTio complex was solved by 
molecular replacement using the structure coordinates of an 
a/?^ThermoDBP-RP2 tetramer as a search model with the 
program PHASER (16). Model building was done manu- 
ally with COOT (17). For all the structures, refinement was 
done with phenix. refine (18,19) or Refmac5 (20) including 
TLS refinement (21). Intermediate and final structures were 
evaluated with MOLPROBITY (22). All structure figures 
were drawn using PYMOL (23). Electrostatic surfaces were 
calculated with APBS (24). 

RESULTS 

Nucleic acid binding by ThermoDBP-RPs 

To investigate the possible cellular functions of 
ThermoDBP-RPs, we first tested the reported interac- 
tions of the proteins with box C/D sRNAs and ribosomes 
or ribosomal subunits in vitro. In electrophoretic mo- 
bility shift assays (EMS As), /?/wThermoDBP-RPl and 
a/>^ThermoDBP-RP2 bound to box C/D sRNAs but the 
interactions were efficiently competed by the addition 
of unlabeled Escherichia coli tRNAs (Supplementary 
Figure S2), suggesting that the proteins do not recognize 
sequence or structural features of box C/D sRNAs with 
high specificity. At low salt concentrations (20-120 mM 
NaCl), the proteins were pelleted by 30S and 50S ribosomal 
subunits and by 70S ribosomes (Supplementary Figure 
S3A; left and middle panels; lanes 3, 6 and 9). Increasing 
the salt concentration to 200 mM abrogated binding to the 
subunits and ribosomes, suggesting that these interactions 
are only marginally stable (Supplementary Figure S3B; left 
and middle panels; lanes 3, 6 and 9). 



Nucleic Acids Research, 2014, Vol 42, No. 10 6701 



While these resuhs do not rule out the possibility that 
ThermoDBP-RP proteins interact with box C/D sRNAs 
or ribosomes in vivo, they prompted us to investigate if 
the proteins interact also with other nucleic acids. In- 
deed, EMS As (Figure lA and B) and isothermal titration 
calorimetry (ITC; Supplementary Figure S4) showed that 
ap^ThermoDBP-RP2 bound to a 21-mer ssDNA of mixed 
sequence or to 21-mer homo-pyrimidine ssDNAs with ap- 
parent Kd values of about 50-100 nM but very weakly or 
not at all to homo-purine ssDNAs, DNA duplexes, ssRNAs 
or an RNA duplex of the same length. A longer (45-mer) ss- 
DNA was gradually shifted in EMS As (Figure IC), indicat- 
ing a stepwise binding of more than one a/7^ThermoDBP- 
RP2 molecule or complex. Under the same experimental 
conditions, /?/wThermoDBP-RPl did not bind to any of the 
21-mer nucleic acids (Figure lA and B) but weakly bound 
to a 45-mer ssDNA (about 50% shift at 50-100 |xM pro- 
tein concentration; Figure IC). Increasing the temperature 
or time of incubation had no effect on any of these inter- 
actions. These data suggest that at least ThermoDBP-RP2 
proteins can act as SSBs. 



Crystal structures and modes of oligomerization of 
/7/MThermoDBP-RPl and a/?^ThermoDBP-RP2 

We next determined the crystal structures of 
;?/wThermoDBP-RPl and a/?^ThermoDBP-RP2 at 2.43 
and 2.05 A resolution, respectively (Figure lA, B and Sup- 
plementary Table S2). As expected from sequence analysis 
(Supplementary Figure Sl),/>/wThermoDBP-RPl contains 
an NTD similar to the rr^ThermoDBP NTD (Figure 
2C; root-mean-square deviation (r.m.s.d.) of 2.0 A for 81 
pairs of Ca atoms). Compared to the /^/wThermoDBP- 
RPl NTD, the f^^ThermoDBP NTD is expanded by 
two additional N-terminal (3 -strands and an extra helix 
between />/wThermoDBP-RPl helices al and a2 (Figure 
2C and Supplementary Figure SI). The C- terminal part 
of />/wThermoDBP-RPl forms an amphipathic a-helix. 
The C-terminal a3 -helices of four />/wThermoDBP-RPl 
molecules associate as an antiparallel, four-helix bundle, 
positioning two pairs of NTDs at the opposite ends of a 
central four-helix rod (Figure 2A). About 14 000 A^ of sol- 
vent accessible surface area is buried upon tetramerization, 
and an interface analysis (25) suggested that the observed 
tetramer is stable in solution. In agreement with this 
analysis, /?/wThermoDBP-RPl eluted from an analytical 
size-exclusion column consistent with the size of a tetramer 
(apparent molecular mass 65.5 kDa; calculated tetramer 
molecular mass 68.9 kDa). 

a/?^ThermoDBP-RP2 contains two globular domains, 
an NTD and a C-terminal domain (CTD) that are con- 
nected by a bent, a-helical linker (helix a3; Figure 2B). The 
NTD is again similar to those of ^^^ThermoDBP and of 
;?/wThermoDBP-RPl (r.m.s.d. of 2.1 A and 0.8 A, respec- 
tively, for 87 Ca atoms; Figure 2C). The a/?eThermoDBP- 
RP2 CTD is composed of a five-stranded, anti-parallel 
P-sheet sandwiched between two a-hehces (Figure 2B). 
a/7^ThermoDBP-RP2 also crystallized as a tetramer, which 
was arranged as a dimer of tightly intertwined dimers 
(Figure 2B). Analytical gel filtration analyses indicated 



that a/?^ThermoDBP-RP2 also forms tetramers in solution 
(apparent molecular mass 111.3 kDa; calculated tetramer 
molecular mass 105.4 kDa). Within a tight dimer (sub- 
units A/A^ and B/B^; Figure 2B), each molecule cradles the 
NTD of the partner molecule between its own CTD and 
the connecting helix a 3 (interface I; Figure 2B), thereby 
burying ^3300 A^ of solvent accessible surface area. Each 
molecule of a tight dimer interacts via its own NTD with the 
NTD of one molecule of a neighboring tight dimer (bury- 
ing ~ 1 300 A^ of solvent accessible surface area; interface II; 
Figure 2B), and via its own CTD with the CTD of the other 
molecule of the neighboring tight dimer (burying ~ 1 600 A^ 
of solvent accessible surface area; interface III; Figure 2B). 
Only interface I within a tight dimer is predicted to be sta- 
ble in isolation, while isolated interfaces II and III (between 
tight dimers) are predicted to be unstable (25). Consistently, 
interface I is largely hydrophobic (0.24 salt bridges/ 100 A^) 
while interfaces II and III are more hydrophilic (1.5 and 0.38 
salt bridges/ 100 A^, respectively). Based the interface anal- 
yses, we expect that /?/wThermoDBP-RPl forms permanent 
tetramers but that interfaces II and III in a/?^ThermoDBP- 
RP2 may intermittently break either partly (forming open 
tetramers) or entirely (forming isolated tight dimers). 

Crystal structure of a ThermoDBP-RP2-DNA complex 

The (2/?^ThermoDBP-RP2 tetramer in the crystal exhibits 
largely electronegative outer surfaces and an interior tun- 
nel system that is lined with electropositive surface poten- 
tial (Figure 2D), possibly suitable for ssDNA binding. To 
reveal how a/?^ThermoDBP-RP2 binds ssDNA, we deter- 
mined its crystal structure in complex with a decameric 
oligo-dT DNA (dTio) at 2.9 A resolution (Figure 3A, 
B and Supplementary Table S2). The crystals contained 
two crystallographically independent ap^ThermoDBP-RP2 
tetramers per asymmetric unit, each associated with two 
dTio molecules (Figure 3A). We will focus our description 
on one of these virtually identical complexes with better de- 
fined electron density. 

All 10 nucleotides of the first DNA strand (DNAl) could 
be traced, whereas only seven nucleotides of the second 
DNA strand (DNA2) were visible in the electron density. 
As shown in Figure 3A-C, the two DNA molecules mean- 
der in the 5^-to-3^ direction from the bottom of the com- 
plex into the interior of the a/?^ThermoDBP-RP2 tetramer, 
where they are cradled between the juxtaposed NTDs of the 
four subunits. The central portions of both DNA molecules 
are bound in the same orientation across equivalent sur- 
faces of two non-contacting NTDs of two tight dimers; i.e. 
DNAl lies across the NTD of the A subunit (NTD^), while 
DNA2 binds across NTD^' (Figure 3B). Superimposing the 
subunit B^-DNA2 sub-complex on the subunit A-DNAl 
sub-complex revealed that nucleotides 5-9 of DNAl and 
the first five nucleotides visible in DNA2 are bound in a sim- 
ilar fashion across NTD^ and NTD^ , respectively (Figure 
3B). We attributed the same numbers to those nucleotides 
of the two DNA molecules that occupy equivalent positions 
on NTD^ and NTD^^ and engage in similar contacts, irre- 
spective of the total length of the oligos (ten nucleotides). 
Thus, visible nucleotides of DNAl were numbered dTl- 
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Figure 1. Nucleic acid binding properties of ThermoDBP-RP proteins. (A) Electrophoretic gel mobility shift assays testing the nucleic acid 
binding capabilities and preferences of />/wThermoDBP-RPl and (2/7eThermoDBP-RP2. ssDNA-d(ACTGCTAGAGATTTTCCACAT); ssRNA- 
r(ACTGCTAGAGATTTTCCACAT); dsDNA and dsRNA additionally contained the corresponding complementary strands. (B) Binding of 
/7/wThermoDBP-RPl and a/7eThermoDBP-RP2 to a 21-mer homo-pyrimidine (dT2i) DNA. (C) Binding of/j/wThermoDBP-RPl and opeThermoDBP- 
RP2 to a 45-mer ssDNA- 
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Figure 2. Structures and oligomerization of ThermoDBP-RP proteins. (A) Ribbon plot of the /)/wThermoDBP-RPl tetramer (subunits A-D). Subunits 
A-D are colored in beige, brown, black and gray (or shaded differently). The NTD and the C-terminal a3 helices forming an antiparallel coiled-coil are 
labeled. (B) Ribbon plot of the a/7eThermoDBP-RP2 tetramer. Monomers are colored/shaded as in A. Subunits A/A' and B/B' form the tight dimers 
that associate into tetramers. NTDs, connecting helices a3 and CTDs are labeled. Roman numerals (I-III) denote the different interfaces through which 
the tetramer forms. (C) Comparison of the NTDs of pfuThermoDBP-RPl (top), apeThermoDBP-RP2 (center) and tteThermoDBP (bottom). Secondary 
structure elements are labeled. (D)Electrostatic surface representation of an apeThermoDBP-RP2 tetramer (electrostatic coloring/shading scaled in the 
range of ±5 kT/e). Entries/exits of an intersubunit tunnel system, which is lined with electropositive surface potential, are indicated. 



dTlO while visible nucleotides of DNA2 were numbered 
dT5^-dTir, with nucleotides dT5-dT9 of DNAl and nu- 
cleotides dT5^-dT9^ of DNA2 being bound in an equiv- 
alent fashion on NTD^ and NTD^ , respectively (Figure 
3B and Supplementary Figure S6). The two DNAs meet 
in the center of the (2/7^ThermoDBP-RP2 tetramer, with 
the 3^-terminal nucleobase (dTlO) of DNAl stacking on 
the fifth nucleobase (dT90 of DNA2 (Figure 3B and Sup- 



plementary Figure S6). The 3 ^-terminal two nucleotides of 
DNA2 (dTlO' and dTllO are deflected from the center of 
the tetramer towards the upper part of the complex (Figure 
3B and C). 

We observed contacts of the protein subunits to the 
sugar-phosphate backbone as well as to the nucleobases 
of the DNA molecules (Supplementary Table S3; Figure 
3B and Supplementary Figure S6). In addition to the in- 
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Figure 3. Binding of ssDNA by «/7eThermoDBP-RP2. (A) Ribbon plot of the a/7eThermoDBP-RP2-dTio complex with the DNAs in cartoon representa- 
tion (DNAl: green/dark; DNA2: yellow/light). Monomers are labeled and colored/shaded as in Figure 2B. (B) Close-up view of two neighboring NTDs 
of the a/7eThermoDBP-RP2 (from subunits A and B'; ribbons) in contact with two dTio molecules (DNAl and DNA2 with nucleotides dTl-dTlO and 
dT5'-dTl 1', respectively; ball-and-stick; coloring/shading as in A). The two DNA strands run across the al helices of the respective NTDs. The rotation 
symbol in this and the following panels indicates the orientation relative to (A). (C) Views on the DNA-binding surfaces of the two tight dimers (A/ A' left; 
B/B' right) with the associated DNAl and DNA2. Relative orientations of the two tight dimers with respect to the tetramer are indicated by the rotation 
symbols. Two surface representations are shown for each tight dimer. Representations on the left show coloring/shading by molecules (as in Figure 2B). 
Black and gray patches on tight dimer A/ A' indicate contact residues to subunits B' and B, respectively, of their neighboring tight dimer (belonging to 
interfaces II or III as labeled); beige and brown/light and dark patches on tight dimer B/B' indicate contact residues to subunits A and A', respectively, 
of their neighboring tight dimer. Representations on the right show surfaces of the tight dimers colored/shaded by electrostatic surface potential (as in 
Figure 2A). DNAl is proposed to outline part of the path that an elongated ssDNA molecule could take through the tetramer. This molecule could ei- 
ther traverse the tetramer diagonally (green/dark dashed extensions; path 1), or turn back along the path that DNA2 is bound in opposite orientation 
(yellow/light dashed extensions; path 2). Both putative paths are lined with electropositive surface potential. (D) Electrophoretic gel mobility shift assay 
testing the binding of increasing amounts of fl/7eThermoDBP-RP2 to ohgo-dT ssDNAs of increasing length. Interpretation of bands is given on the right. 
dTi5, dT2o, dT25 and dTso show only one shifted band, a second, slower migrating band appears with dT3o at higher a/7eThermoDBP-RP2 concentrations, 
indicating that a ssDNA molecule of 35 nucleotides can accommodate two fl/7eThermoDBP-RP2 tetramers. (E) Electrophoretic gel mobility shift assay 
testing the binding of (2/?eThermoDBP-RP2 to circular Ml 3 phage ssDNA on a 1% agarose gel. DNA was visualized by ethidium bromide staining. Upon 
addition of increasing concentrations of the protein (indicated below the gel), the DNA migrates progressively slower, indicating that increasing numbers 
of a/7eThermoDBP-RP2 molecules bind to the circularly closed DNA. (F) Model for binding of long ssDNAs to an <3/7eThermoDBP-RP2 tetramer along 
pathl (top) and path 2 (bottom). Left panels: Views on the ssDNAs running across the a/7eThermoDBP-RP2 A/ A' tight dimer with the other dimers 
shown as a semi-transparent outline. Right panels: Same views with the B/B' tight dimer as solid surfaces. 
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teractions of the central portions of DNAl and DNA2 
with NTD^ and NTD^ , respectively, they both engage 
in additional contacts to all other protein subunits of the 
a/7eThermoDBP-RP2 tetramer. These interactions again in- 
volve primarily residues from the NTDs of the other sub- 
units (Supplementary Table S3 and Supplementary Fig- 
ure S6). Thus, the NTD of one protein subunit provides 
the main binding platform for one ssDNA molecule and 
NTDs from the other subunits engage in different contacts 
to this DNA molecule and thereby guide it asymmetrically 
through the symmetric protein tetramer. 

In DNAl, dTl and dT2 stack on each other and on 
N211 of helix a6 from CTD^ (the only CTD-DNA inter- 
action seen in the entire structure). The side chain of dT3 
is rotated outwards, introducing a first kink in the DNA 
backbone. The kinked backbone is stabilized by contacts 
to K63 and L64, originating from the loop following he- 
lix a2 of NTD^'. The following dT4, dT5 and dT6 again 
form a continuous stack that is capped by F23 from he- 
lix al of NTD^. This helix is wedged between nucleotides 
dT6 and dT7, introducing a second kink in the nucleic acid. 
dT8 stacks on dT7 and the two nucleotides are sandwiched 
between the central portions of the NTD^ and NTD^ al 
helices. NTD^ helix al imposes a third kink and guides 
the 3^-terminal two-nucleotide stack (dT9-dT10) towards 
the center of the a/?^ThermoDBP-RP2 tetramer, where the 
backbone of dTlO is held between K17 and R20 from he- 
lix al of NTD^ . The conformation and path through the 
a/?^ThermoDBP-RP2 tetramer of the second DNA strand 
is similar except for its last two nucleotides, dTlO^ and dTl T 
(see above). While the protein imposes a multiply kinked 
structure on the DNA ligands, the a/?^ThermoDBP-RP2 
tetramer itself did not undergo any significant conforma- 
tional changes upon DNA binding (r.m.s.d. ~0.9 A for 880 
Ca-pairs between DNA-free and DNA-bound forms). 

Molecular basis for the nucleic acid-binding preferences of 
a/;^ThermoDBP-RP2 

The kinks in the DNA backbone are incompatible with con- 
tinuous Watson-Crick base pairing in a DNA duplex and 
the size of the central tunnel system in an a/?^ThermoDBP- 
RP2 tetramer could not accommodate duplex DNA (or 
RNA), rendering a/?^ThermoDBP-RP2 specific for single- 
stranded nucleic acids. Purines could be accommodated at 
most positions but a purine at positions 9/9' would clash 
with the al helices of NTD^/NTD^ , respectively, explain- 
ing why (2/7^ThermoDBP-RP2 can bind ssDNA of mixed 
sequence but fails to bind homo-purine sequences (Supple- 
mentary Figure S4). Alternatively or additionally, homo- 
purine sequences may be excluded because of their tendency 
to form higher-order structures, such as dG-quadruplexes 
(26) or poly-dA parallel helices (27), which could not be 
accommodated in the restricted internal tunnel system of 
the ap^ThermoDBP-RP2 tetramer. Although the protein 
interacts with several nucleic acid bases, none of these con- 
tacts would be expected to further restrict the nucleic acid 
sequence that can be bound. Furthermore, a 2^-hydroxyl 
group on nucleotides 2/2^ would clash with the base of 
nucleotides 4/4^, a 2^-hydroxyl group on nucleotides 4/4^ 
would sterically interfere with the ribose of nucleotides 5/5^ 



and a 2^-hydroxyl on nucleotides 6/6' would clash with the 
backbone of Arg20 of NTD^, explaining the preference of 
(2/?^ThermoDBP-RP2 for ssDNA over ssRNA. 

Extent of an a/;^ThermoDBP-RP2 binding site and engage- 
ment of ssDNA lacking free termini 

The precise stoichiometry and ssDNA binding mode seen 
in the present crystal structure may be a consequence 
of the short DNA oligomers and high DNA concen- 
trations used during the crystallization experiment. To 
estimate the number of nucleotides occupied by one 
ap^ThermoDBP-RP2 tetramer on ssDNA in solution, 
we conducted EMSA experiments with increasing length 
oligo-dT ssDNAs (Figure 3D). Up to a length of 30 
nucleotides only a single shifted band was detected, in- 
dicating binding of one (2p^ThermoDBP-RP2 tetramer 
(Figure 3D). However, starting with a length of 35 nu- 
cleotides, a second, slower migrating band appeared at 
higher a/7^ThermoDBP-RP2 concentrations (Figure 3D), 
indicative of two a/?^ThermoDBP-RP2 tetramers being ac- 
commodated on a dT35 ssDNA. These results suggest that 
one (2/7^ThermoDBP-RP2 tetramer occupies a stretch of 17 
to 18 nucleotides on a ssDNA target, very similar to the 
combined number of nucleotides from DNAl and DNA2 
that we see bound to an ap^ThermoDBP-RP2 tetramer in 
the crystal structure (17 nucleotides). 

Physiologically, a/?eThermoDBP-RP2 may engage ex- 
tended ssDNA stretches that lack free ends. While we 
also obtained crystals of a/?^ThermoDBP-RP2 with longer 
DNA oligomers, they diffracted poorly (about 5 A resolu- 
tion) and the electron densities for the DNA ligand(s) were 
poorly defined. To test whether a/?^ThermoDBP-RP2 is ca- 
pable of binding ssDNA molecules that lack free termini, 
we performed EMSAs with single-stranded circular Ml 3 
phage DNA (Figure 3E). Upon addition of increasing con- 
centrations of a/?^ThermoDBP-RP2, the ssDNA was pro- 
gressively retarded on the gels, indicative of the protein oc- 
cupying multiple binding sites in a stepwise manner. Due 
to the large size of the Ml 3 DNA (7249 nucleotides) and 
the limited number of nucleotides required to accommo- 
date one a/?^ThermoDBP-RP2 tetramer, it is likely that at 
each increment in protein concentration multiple additional 
(2/?^ThermoDBP-RP2 tetramers bind. 

DNA-binding may be auto-inhibited in the /?/wThermoDBP- 
RPl tetramer 

Residues of a/?eThermoDBP-RP2 that contact the DNA 
molecules in the present structure are conserved in 
/?/wThermoDBP-RPl (Supplementary Figure SI). To inves- 
tigate why the latter protein nevertheless binds poorly to ss- 
DNA (Figure IB and C), we superimposed the NTD^ do- 
main of a/?^ThermoDBP-RP2 in complex with the DNAl 
molecule on an NTD of the /?/wThermoDBP-RPl tetramer 
(Supplementary Figure S5A, left panel). The superposition 
revealed that in /?/wThermoDBP-RPl, the putative DNA- 
binding surface on the NTD is occluded by the C-terminal 
tail of another /7/wThermoDBP-RPl subunit (Supplemen- 
tary Figure S5A; close-up in left panel). Residues 128- 
139 of subunit B clash with a DNA molecule modeled on 
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the NTD of subunit A. Thus, the present crystal structure 
of /?/wThermoDBP-RPl may represent an auto-inhibitory 
conformation with respect to ssDNA binding. We at- 
tempted to test whether the NTD of;7/wThermoDBP-RPl 
alone interacts more strongly with nucleic acids but failed to 
produce the protein in soluble form. The weak affinity for 
longer DNA oligonucleotides seen with /7/i/ThermoDBP- 
RPl (Figure IC) may indicate that the protein can bind ss- 
DNA only in special contexts, where the presumed auto- 
inhibition is relieved, for example by other interacting pro- 
teins. 

DISCUSSION 

ThermoDBP-RP2 proteins can act as non-canonical SSBs 

We have demonstrated that the NTD, conserved among 
ThermoDBPs and ThermoDBP-RPs, is a versatile ssDNA- 
binding domain in ThermoDBP-RP2 proteins in vitro. ss- 
DNA can come to lie on various sites and in different ori- 
entations on multiple copies of this domain in an oligomeric 
protein. While these results show that ThermoDBP-RP2 
proteins can function as SSBs in vitro, a similar activity in 
vivo remains to be demonstrated. 

The fold and mode of ssDNA binding of ThermoDBP- 
RP2 differ markedly from previously characterized SSBs 
(3). Canonical SSBs typically bind ssDNA or ssRNA on 
their outer surfaces with the bases pointing towards the pro- 
tein and the backbone solvent exposed. In the case of bac- 
teriophage T4 gene 32 protein, ssDNA binds along a deep 
channel formed between two subdomains (28). In contrast, 
a ThermoDBP-RP2 tetramer binds ssDNA inside an inter- 
nal, intersubunit tunnel system. This unprecedented bind- 
ing mode may be particularly useful to protect ssDNA in 
challenging environments such as high temperatures. 

Molecular mechanism of ssDNA engagement 

Our crystal structure of an ap^ThermoDBP-RP2-DNA 
complex shows directly that the protein can bind short or 
nicked ssDNAs. Such a function could be required to in- 
termittently hold on to DNA ends during recombination 
or repair events. It is even imaginable that such an activity 
could be used to detect linear, foreign DNA molecules as 
part of a simple immune system, but there is no direct evi- 
dence supporting this latter idea. 

On the other hand, we could show that a/?eThermoDBP- 
RP2 can bind efficiently to a circular ssDNA molecule that 
lacks free termini, suggesting that it could also substitute 
for canonical SSBs in general ssDNA-binding functions. We 
suggest that the short ohgomers bound in our co-crystal 
structure also mark the possible paths that a single, long 
ssDNA molecule would take through the tetramer. The 
DNAl molecule most likely outlines a major part of the 
binding site, as it represents that largest coherent stretch 
of DNA that could be traced in the electron density. The 
5^ -end of that molecule emerges from the internal tunnel 
system and lies on the surface of the a/?^ThermoDBP- 
RP2 tetramer. Additional nucleotides on this side of the 
DNA most likely simply extend into the solvent. Addi- 
tional nucleotides at 3^-end would come to lie in positions 
that in our crystal structure are occupied by the DNA2 



molecule. Topologically, the 3^-end of DNAl could be di- 
rectly connected to the 3^-terminal two nucleotides (dTlO^ 
and dTl 1^ of the DNA2 molecule. Thus, these terminal two 
nucleotides of DNA2 point out a possible exit path for an 
extended ssDNA molecule between the CTDs of subunits 
A and B and the NTDs of subunits A and B (path 1; green 
dashed line or tube in Figure 3C and F). Alternatively, an 
extended DNAl molecule could bind along the path out- 
lined by nucleotides dT9^-dT5^ of the DNA2 molecule but 
in opposite direction (path 2; yellow dashed fine or tube 
in Figure 3C and F). Although in this scenario it would 
traverse the same surfaces of NTD^ and NTD^ in oppo- 
site orientation as seen for DNA2 in the present structure, 
such a binding mode is not entirely unlikely given the ver- 
satile contacts that are seen between different NTDs and 
portions of the bound DNA molecules in our crystal struc- 
ture. It is also supported by the quasi-continuous stacking 
interaction observed between the 3^-terminal nucleotide of 
DNAl (dTlO) and an internal nucleotide (dT90 of DNA2. 
Both putative paths for extended ssDNA molecules through 
the ap^ThermoDBP-RP2 tetramer are lined with positive 
electrostatic surface potential on the protein (Figure 3C). 
However, due to ensuing steric hindrance, it would be im- 
possible to bind two extended ssDNA molecules to an 
a/?^ThermoDBP-RP2 tetramer at the same time (e.g. along 
both of the two putative paths through the internal tunnel 
system). 

Another immediate question is how an a/^^ThermoDBP- 
RP2 tetramer can engage a ssDNA stretch that lacks free 3^- 
ends and thus could not be threaded end-on into the tunnel 
system of the tetramer. Given the predicted lability of in- 
terfaces II and III of the a/?^ThermoDBP-RP2 tetramer, we 
envision that the protein complex can open up along these 
interfaces to engage extended ssDNA molecules at its cen- 
ter. The complex would not have to completely dissociate 
into tight dimers; opening of one interface III at the top or 
bottom of the tetramer would suffice to allow accommoda- 
tion of an extended ssDNA molecule at the internal tunnel 
system. 

Proteins bearing ThermoDBP-RP2-like NTDs 

Although the ThermoDBP-RPl NTD closely resembles the 
NTD of ThermoDBP-RP2 proteins, the physiological func- 
tion of ThermoDBP-RPl proteins remains unclear. Our in 
vitro binding studies do not support, but also do not ul- 
timately rule out, a specific ribosome or box C/D sRNA 
association of these proteins, as had been reported previ- 
ously (8). Weak DNA binding and the observation that 
the equivalent of the main ssDNA-binding surface of the 
ThermoDBP-RP2 NTD is obstructed in the tetrameric or- 
ganization of /?/wThermoDBP-RPl suggest that the pro- 
teins may require special activatory mechanisms to engage 
ssDNA or other nucleic acids. Such activation (or relief of 
auto-inhibition) may be provided by other interacting pro- 
teins that force the /?/wThermoDBP-RPl NTDs into a dif- 
ferent relative arrangement with respect to the central four- 
helix bundle. 

Gel filtration and sequence analyses suggested that Ther- 
moDBPs form parallel dimers via a C-terminal leucine zip- 
per motif (7). If these putative, parallel ThermoDBP dimers 
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resemble a parallel dimer of our present />/wThermoDBP- 
RPl structure (e.g. subunits A and D; Figure 2 A and Sup- 
plementary Figure S5A), the main TliermoDBP-RP2-like 
ssDNA binding surface of the ThermoDBP NTDs would 
remain unobstructed (Supplementary Figure S5A; right 
panel). However, most of the ssDNA-contacting residues 
of the a/?^ThermoDBP-RP2 NTDs are poorly conserved 
in ThermoDBPs (Supplementary Figure SI), suggesting 
that ssDNA could also be bound via a different surface 
in the latter proteins. The NTDs in the presumed paral- 
lel ThermoDBP dimers also would not cluster as seen in 
the (2p^ThermoDBP-RP2 tetramer and must bind ssDNA 
non-cooperatively or cooperate in a different fashion in ss- 
DNA binding than observed in a/?^ThermoDBP-RP2. Ir- 
respective of the exact mode of ssDNA binding in Ther- 
moDBPs, the observation of strong ssDNA binding by full- 
length rr^ThermoDBP (7) indicates that ThermoDBPs are 
not auto-inhibited. 

A search of the Protein Data Bank with the Dali server 
(29) using the NTD of a/?^ThermoDBP-RP2 revealed its 
similarity to a C-terminal dsDNA-binding domain in the 
eukaryotic DNA polymerase eta (r.m.s.d. of 1.75 A for 97 
Ca atoms; Supplementary Figure S5B). Unlike the binding 
of the a/?^ThermoDBP-RP2 NTDs to ssDNA, the DNA 
polymerase eta dsDNA-binding domain employs its (3- 
sheet surface to engage dsDNA along its major groove. To- 
gether, our analyses show that the ThermoDBP-RP2-like 
NTD provides a structural scaffold that can be adapted in 
diverse ways for ssDNA binding (as in the cases of Ther- 
moDBPs and ThermoDBP-RP2s) or for dsDNA binding 
(as in the case of the C-terminal domain of DNA poly- 
merase eta). 
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